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A leader election algorithm is an elimination process that divides recursively into tow subgroups an initial group of n 
items, eliminates one subgroup and continues the procedure until a subgroup is of size 1. In this paper the biased case 
is analyzed. We are interested in the cost of the algorithm, i.e. the number of operations needed until the algorithm 
stops. Using a probabilistic approach, the asymptotic behavior of the algorithm is shown to be related to the behavior 
of a hitting time of two random sequences on [0, 1]. 
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1 Introduction 

A single-hop network is a distributed system of n nodes, also called stations, sharing a common com- 
munication channel which can transmit only one message per time unit. In the special case of collision 
detection, the channel is ternary feedback; each station sending a message to the network can simultane- 
ously listen to the channel and detect: a collision when at least there are two broadcast attempts, a silence 
when no station sends message, or a success when exactly one station sends its message. A single-hop 
network with collision detection is called multiple access channel. 

Consider a multiple access channel of n stations which has to elect a leader to control and organize the 
network. Because of links or stations failures, the leader may be temporarily out of service. Such failure 
can be detected by a silence, in which case the system stops normal operations and initiates the election 
process: the system has to identify a new leader in a reasonable execution time. We are interested in the 
cost of the algorithm, i.e. the number of operations needed to find a leader. 

1. 1 Leader election problem 

We assume that the size n of the multiple access channel is unknown. Moreover, each station is assumed 
to have a unique identifying number ID. To elect a leader among themselves, stations have to use the same 
algorithm. The case n e {0, 1} is trivial, n is assumed to be greater than 2. Let us recall the basic one: 

— Deterministic Initialization: At the first time unit, each station send a message with its ID number 
to the common channel. As n > 2, all stations detect a collision. 

— Randomized Selection Process: Each station S generates independently a Bernoulli random vari- 
able Bs with parameter p. Only which obtains Bs = 1 is allowed to send its message again during 
the next time unit. 

For a station S, there are two cases: 

1. If Bs — 1, station S will be called Acf/ve; S sends again its message to the channel and can detect 

— a success; only station S is trying transmission, then all the other stations receive its ID's 
message and S obtains the status of leader. The protocol is finished. 

— a collision; station S is not the only candidate to be leader, and so has to generate Bs again. 
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2. Otherwise, station S becomes Non Active; it remains candidate to be leader, listens to the channel 
but does not participate to the transmission. So it can detect 

— a success; only one station S' ^ S is trying transmission. The other stations (including S) 
detect its ID. So S' obtains the status of leader. The protocol is finished. 

— a collision; although station S is not participating to the selection process, there are at least 2 
Active stations. So, station S is eliminated. 

— a silence; all stations are Non Active, so station S has to generate Bs again to send or not its 
id's message to the channel. 

That is, at the end of the protocol, a single station remains Active and becomes the leader of the system. 

This splitting process using a Bernoulli random variable was also used in the tree protocol of Capetanakis 
and Tsybakhov. For a survey, see Mathys and Flajolet (1985). 

The example below illustrates the election process appHed to a group of 4 stations {A, B, C, D}. In this 
case, the leader A is elected in 4 times units. 



time units 
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3 
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Non Active Stations 
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BC 


Eliminated Stations 
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Channel feedback 


Collision 


Collision 
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Fig. 1: Election of the leader A; H4 — 4. Incomplete tree structure. 



Definition 1 (Algorithm Cost) It is the number of rounds needed to find a leader. Denote by Hn the 
algorithm cost when the size of the network is n. 

Such a randomized elimination algorithm has various applications in distributed systems like cellular 
phones and wireless communication networks. In mobile Ad-hoc networks, failures occur when mobile 
nodes move out of transmission range. The unstable topology of the network makes leader election 
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problem more complex. For more details, see Malpani et al. (2000). Electing a leader in a computer 



network is fundamental to supervise communication and synchronization. See Fill et al. (1996). It is also 



studied in a context of radio networks. For an interesting survey on randomized communication in this 



collision detection, see Lavault et al. (2003) 



context, see Chlebus (2001 ). For more elaborate leader election algorithms on radio network with no 



1.2 Splitting process and tree structure 

Formally, the algorithm starts with a group of n items which is divided in two subgroups. The probability 
that an item is sent into the left subgroup is p. This subgroup will be divided by the same process. The 
other items will be ignored. If the left subgroup is empty, the algorithm restarts from the previous level. 

This distributed algorithm is a randomized elimination process with a natural binary tree structure (Fig.|l]). 
At the root of the associated tree, is the initial group of items. In the first split, it generates two nodes: the 
left one will be split by the same process, the right one is a terminal node, also called leaf, which will not 
be treated by the algorithm except when the left node is empty. Only in this case, the right node will be 
spUt into two one. 

Thus, this tree structure can be represented as an incomplete tree in which only one side is developed. We 
define the height of the associated tree a s the length of th e path from the root to the leader which is the 
longest root-to-leaf path in the tree (see Fill et al. (1996) ). Then the algorithm cost is equivalently the 
height of the associated tree. Fig. |] illustrates this equality. 

1.3 Previous works 

It is known that the avera ge cost of the le ader election algorithm is of logari thmic order in n with an 
oscilla ting behavior. See Prodinger ( 1993 ) for the unbiased case p = 1/2, [fanson and Szpankowski 
( |1997| ) for the biased one p ^ 1 /2. 



Consider the Poisson model of the leader election problem, i.e. the election process applied to a network 



with random size following a Poisson process Af^ (see 1.6). Let h the Poisson transform of the sequence 
of average cost of the algorithm (E(i7„))jj>Q. 

Definition 2 (Poisson transform) For x > 0, the Poisson transform of the sequence (E(i?„)) is the func- 
tion h defined by 



h{x) = E{H^^ 



11=2 



Then, function h is solution of a functional equation, called basic functional equation associated to the 
algorithm 

h{x) ^ h{px) + h{qx) e''"' -\- f{x), wherep + g = l (1) 
and / is a given function. Equation (jl]) is the starting point of all studies made on this algorithm. 

Tine unbiased case 

When the splitting process follows a Bernoulli random variable of parameter 1/2, the leader election 
algorithm is called symmetrical. Observe that, for the unbiased case, the functional equation (??) is 
solved by direct iteration. In fact, the Poisson transform h verifies 



h{x) = h{x/2) ( 



1 



,-x/2 
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which can be rewritten g{x) — g{x/2) + f{x)/{l — e ^) where g{x) — h{x)/ (1 — e 



The first analysis of the leader election algorithm was proposed by [Prodingeij ( |1993| ). He investigated 
different parameters of interest such as the height, called depth in his paper, the size of the associated 

tree, i.e. the number of nodes Using combinatorial techniques, he established exact expressions and 

asymptotic formulas for these quantities for the symmetrical case. So, it is shown that for an initial group 
of size n, the algorithm stops on avera ge after ab o ut log ^ n steps. Using complex analysis techniques like 
Mellin and inverse Mellin transform. Fill et al. ( 1996[ ) studied the asymptotic behavior of the first two 
moments of the algorithm cost. Moreover, they obtained the exact expression and asymptotic behavior 
of the distribution of i7„ and they have shown that a limit distribut ion for the centered algorithm cost 
Hn — \}og2 n\ does not exist. For a survey on Melhn transform, see Flajolet et al. (1995). 



The biased case 

If the splitting process is biased, i.e. the probability that an item is sent into the left subgroup is p ^ 
1/2, the algorithm is called asymmetrical. Studies on biased case become more rare. An asymmetric 
leader election algorithm was investigated by [lanson and Szpankowski (1997) using complex analysis 
techniques. The asymptotic behavior of the first two moments of the algorithm cost iJ„ is given in term 
of the sequence of their exact values (E(_ffj ))^gpj computed numerically from two recurrence equations. 

This implicit dependence is due to the asymmetry of the functional equation (??) obtained by Poissoniza- 
tion. The coefficient e^P^ makes more complex the establishment of an iterative scheme such as in the 
context of a protocol for a multi-access broadcast channel (see Fayolle et al. (1986)). Applying the Mellin 
transform to equation (??) without solving it yields this dependence. 

1.4 Related leader election algorithms 
Leader election algorithm in network of fixed size 

Consider a simple algorithm for leader election algorithm in the context of communication network; 
at each level, the probability p for a station to send its message depends on the number n of stations 
remaining in the elimination process; p — 1 /n. Expected run time is 0(1) but it is clear that is necessary 
to know the number of active stations in advance, or at least to estimate it. See |Willard| ( |1984| ) for an 
estimation procedure in order of log log n + 0{l/n). This variant of the basic leader election algorithm 
does not exhibit an oscillating behavior any more. In fact, the average algorithm cost is asymptotically 



equivalent to a some constant L. For more details, see Lavault and Louchard (2005 ). 



/.Z77 data compression Scheme 

Consider a variant of the leader election algorithm by introducing a moderator who determines the elim- 
ination process; each of participants and the moderator throws ind ependently a coin and only tho se who 
obtain the sa me result as the m oderator continue the process. See Ward and Szpankowski (2004) for the 
biased case, ^rodingei ( 1993 ) for the unbiased one. Let Af„ the number of participants remaining in the 



last nontrivial round from an initial group of n items. It is asymptotically equivalent to the multiplicity of 
phrases in the LZ'77 data compression scheme. 

1.5 Overview 



In a previous paper on splitting algorithms, Mohamed and Robert (2005) proposed a direct approach 



based on a probabilistic reformulation of a basic functional equation associated to such algorithms. The 
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purpose of this work is to apply the techniques used by VIohamed and Robert (2005) to analyze an 
additive quantity in the context of an incomplete tree structure. In Section a similar series formula 
for the average cost E{Hn) is given by Proposition ^ The asymptotic behavior of the algorithm is 
studied in Section || and reformulated on the behavior of some stopping time r. Theorem |l] presents a 
new representation of the asymptotic oscillations of the algorithm. In Section Q the distribution of the 
algorithm cost is investigated. Using the binary decomposition of the interval [0, 1], the exact expression 
of t he distribution of Hn is es t ablish ed. Proposition ^ is a slight variation of the asymptotic formula given 



by panson and Szpankowski| ( |1997[ ) for the distribution of the algorithm cost Hn in the biased case. 

1.6 Notations 

Throughout this paper, (t„)„>i is a non decreasing random variables sequence such that 

• ti follows an exponential distribution with parameter 1, 

• {tn+i — tn) is a sequence of i.i.d. random variables exponentially distributed with parameter 1. 
For X > 0, let A/^ be the number of tn in the interval [0, x]. It is a i. v. with Poisson distribution . 

2 Average Cost of The Algorithm 

2. 1 Algorithm cost 

The algorithm cost is the number of steps needed to find a leader, or equivalently the height of the associ- 
ated tree. Denote by iJ„ this quantity when the size of the initial group of items is n, then, for n >2, this 
random variable verifies a recurrence relation; 

Hn ■ 1 + Hi^s„ l{S„7tO} + H2,n 1{S„=0}! 

with the boundary conditions Hq — Hi — 0, where {Bi{p))i<i<n are n independent Bernoulli variables 
of parameter p, 

n 

Sn ^ ^Bj(p), 

1=1 

for {m,n) e N^, Hi ^ and if2,n are independent and, for i — 1,2, the variable Hi^m has the same 
distribution as H„i. So, for n > 0, the recurrence equation for the sequence {Hn) can be rewritten 

Hn ■ 1 + Hs„ + Hn 1(5^=0) ~ l{n<l}- (2) 

2.2 Poissonization 

Consider the Poisson model, i.e. the size of the initial group of items is random following a Poisson 
process Afx of intensity 1 on the interval [0, x]. The following proposition gives a useful representation of 
the Poisson transform of the average cost of the algorithm. 

Proposition 1 Forx>0, 



/ + 00 ^ \ 

\i=0 J 
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where {Aj , Bj ) is a sequence ofi.i.d. realizations of a couple of random variable {A^ B) with distribution 

P(A = p, B = 0) = p, P(A = q,B =p) = q, 
ttq = 1, ao = and, for i > 1, 

i-l i-1 
j=0 j=0 

Proof: Let h the Poisson transform of the average cost (see Definition |2|). Then, the recurrence equation 
(H) for the sequence (Hn)n>o becomes 

h{x) = h{px) + h{qx) e-P"" + 1 - (1 + 2:)e~^. 

Following the approach of Mohamed and Robert ( |2005 ), direct iteration becomes possible using a prob- 
abilistic formulation of the last equation as below 

h(x)=E(^^^e-^^^+f{x), (3) 

where f{x) = 1 — (1 + x)e~^ and {A, B) is couple of random variables with distribution 

P(A = B = 0) = p, P(A = q,B =p) = q. 

Let the sequence of /./.t/ realizations {Ai, Bi)i(zfii of the couple of random variables {A,B). We introduce 
some notations; for a: > 0, Xq ~ x, Yq = 0, and for n G N, 

Xn+l ~ An Xn; Yn+1 — Bn Xn- 

By iterations of equation (|]), one gets at the (n + If^ stage 

Since /i'(0) — and, almost surely, lim„_^+oo X^+i = 0, then, one obtains 



Kx) = E 7 (1 - (1 + ^Oe~"'") e-"'^ j 



where ttq = 1, ao = and, for i > 1, 

i-l i-l 

As the sequences [ai) and (a^ + tt^) are, almost surely, in the interval [0, 1], the function h can be 
represented as follows 

/ + 00 ^ \ 

h{x) = IE ^ — '^{ti>aiX ■t2<(ai+TTi)x} ■ (4) 

The proposition has been proved. □ 
^From now on, throughout the paper, we conserve the notations introduced in this proof. 
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The next step is the probabilistic de-Poissonization of (^) following the method of |Robert| ( p005[ ) to obtain 
the expression of the average cost E(7?„). 

Proposition 2 (Probabilistic representation of the average cost) For n >2, 

(!7i.„,C/2,„)-l 



1=0 

where, for Q < x < y < 1, T{x,y) = min {iy{x); fj,{y)) with 

= inf {i > 1 : Oi > x} , 
fi{y) = inf {i > 1 : + TTi < y} , 

and Ui^n is the ith smallest variables ofn independent, uniformly distributed random variables on [0, 1] 
independent of the sequence {Aj , Bj).y^. 

Proof: For a; > 0, by decomposing with respect to the number of points of the Poisson process [Afx) in 
the interval [0, x], one gets, for < a < 6 < 1, 

P(ii > ax ,t2 <bx) ^Y P(ii > ax , t2 < bx\Mx = n)¥{Mx = n). 

n=2 

For n>2, conditionally on the event {Mx — n], the couple of variables (ti, has the same distribution 
as the couple {xUi^n,xU2.n) of the two smallest random variables of n uniformly distributed random 
variables on [0, x]. So, we get the identity 

fp^ x" _ \ 

F{ti > ax, t2 <bx)=E ( l{t/i,„>a,(72.„<6}^e-"^ 1 . 

Due to the independence of the sequence {Ai, Bi) and {ti , 12), and using the Fubini's Theorem, one gets 



+00 / / +00 

r!=2 \ \j:=0 ' 



C/2,„<(a.+7r.)} ] ] 



The identification of the representation of the Poisson transform (see Definition g) ¥.{Hj^_^) and the last 
identity gives the following formula for n > 2 



/+00 ^ 

\^=0 



Since, almost surely, the sequence (Q!i)i>o is increasing to a random variable a e [0, 1] and the sequence 
{ai + 7Ti)i>o is decreasing to the same random variable, the following equality holds 

{i>0 : Ui,n > TT, , U2.n < tti + 71;} = [0, r(C/i,„ , f/a,™) - 1], 
where the hitting time r is defined as above. □ 
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3 Asymptotic Analysis of The Average Cost 

3. 1 Two random sequences and one hitting time 

It is clear that the key of the analysis of the asymptotic behavior of the algorithm is the hitting time 
r written on the two random sequences {ai)i>Q and (a^ + 7rj)j>o. Let (7^) the sequence of random 
variables defined by 

(7.),eN-b >0 : B,=p}. 

These are the moments of jump of the sequence {ai)i>Q and conversely the instants of stopping for the 
other one, (tt^ + ai)i>Q. It is clear that these moments can be recursively defined as below: 70 = Go and 
jn+i = 1 + 7n + Gn+1, wherc {Gn)n>o ^ scqucncc of i.i.di. v. with a geometric distribution Geo{q) 

F{Geo{q) = k) = qp^. 

So, it is easy to see that 

v{x) e {1 + 7, : i e N}, ^J.{y) ^ {1 + 7,; : i e N}. 

Using a discussion on the position of the hitting time r in comparison with the sequence 7, we establish 
the following lemma which will be proved in the Appendix || 

Lemma 1 



(y)] +(riogp(p(log,(y))y)] - [\og^{y)\) 

^\ — l{7o=Liogp(y)Jm=riogp(p(iogp(?/))?/)l} I 

^-*=i+riogp(p(iogp(y))a)l 



where Vl{x,y) = {(a;,?;) € (]0,1[)^ : [logp(i/)] = \\ogp{xy\} and p is a periodic function with magni- 
tude 1 defined for z > by 

1 

p{z) = , {z} — z — \z \ is the fractional part of z . 

1-p 

3.2 Asymptotic fluctuations phenomena 

Theorem 1 (Asymptotic behavior of the average cost) The average cost E(i7„) admits the following 
asymptotic formula 

E(ff„) = - logp(7i) + E (riogp(t2)l ) + F(logp(n)) + 7^(n), 

where F is a periodic function defined for all z > by 

f°° r 1 / 1 _ „i-{i°gp y-z} \ 
Fiz) = J y(l-/-{'°Sp ^-->) I [logp( i^j— ) + logpy-zl - [logpy-zj j e-^dy, 

(5) 
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i7„ — n{Ui^n, U2,n) ond TZ{n) is a rest discussed in Section defined by 

I t(;7i,„,C/2..)-1 ^ . 

7l(n)=E ( ^ — )l{7o=Liogp(c/2,„)Jm=riogp(p(c/2,„)c/2,„)l} ) • (6) 

\ J=l+riogp(p(logp(C/2,„));72,„)l * 

Proof: Using Lemma one gets 

E(iJ„) = E(riogp(t/2,0]) +E((ri0gp(p(logp([/2,„))(72,„)l - Ll0gp((72,n)J)lfl„) 

/ T(;7i,„,t/2.„)-i _^ 

+ ^ ( XI — )l{70=Llogp(C/2,„)J;7i = riogp(p(C/2,„);72,,01} If^n 

\ «=l+riogp(p(logp(2/))a)l * 

The only not neglect terms are 

ri(n) = E (riogp([/2,„)l) and r2(n) = E (([logp(p(logp(C/2,„))f/2^„)l " \_^ogp{U2^n)\)ln^) ■ 

As n goes to infinity, nU2.n converges in distribution to a random variable t2 which is a sum of two i.i.d. 
exponential random variables with parameter l.Then, the first term satisfies 

ri(n) = E(riogp(i2) - logp(n)]) + O(^). 
Let T), function of — logp(n), the difference 

P(-logp(n)) =E(riogp(i2) -logp(n)l) - (E( riogp(t2)1 ) - log,(n)) 

It is easy to check that 'D{z) = — [zj , then lim„^+oo n'D{— log^ n) = lim^^+oo P^^ = 0, 

and one gets 

Ti(n) = -logp(n) +E(riogp(t2)l) +0(-). 
The last term 72 (n) is asymptotically equivalent to F(logp{n)) where F is defined by ^. In fact 

|F(logp(n))-r2(n)| < / (1 - / lQg^{p(^\ogp(y/n)))ye-ydy+-F{\ogp{nj)+2e- 

Jo n- Jn 

Observe that 

^oc poo 

logp(p(logj,(y/n))) y e'^dy = logp(p(logp y))y e'^^ydy. 

By decomposition on the sequence of intervals ( , p'^\ ) , the last integral is dominated by a geometric 
sum and the following inequality holds for n > 2 

logp(p(logp(2/))) y e-^^ydy < f-^. 

Then, 

F(logp(n)) -E((riogp(p(logp([/2,„)){/2,„)l - Llog,(f/2,„)J)lo„) = 0{^). 
This ends the proof. □ 
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3.3 Estimation of the rest 

The final step is to estimate the rest TZ{n) defined by (^). For x,y ^ [0,1], K > k > 
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E 



i=K 



where 5 = min(p, q). The following result is admitted. 
Conjecture 1 The hitting time r satisfies 



sup E f < oo. 



Ee[o,i] 



Remark 1 Conjecture ^ is an intuitive restriction on the exponential moment of the hitting time r. It is 
supported by some simulations (Fig^^ of x :— > E {{-^ Y^^'^'^^ using Monte-Carlo techniques. Observe 
that, for the unbiased case (Fig^, the maximum corresponds to numerical values of x around 0.5 which 
is, on average, the limit a of the two random sequences (a^) and (a^ + TTj). This maximum is of order of 
10^^, which is reasonable since it implies that 

E(r) < 14 log4(10) w 23.25 

For the biased one (Fig^, since 6 — 0.2, a maximum of the order o/lO^" is acceptable; E(r) < 57.22. 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 



Fig. 2: Unbiased case: simulations of a; :— > E ^4^(^'^)^ . 
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0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 



(a) p = 0.2, & = p 



(b) p = 0.8, S = l-p 



Fig. 3: Biased case: simulations of x E 



Since, for < a; < y < 1, t{x, y) < max(T(a;, a;), t(j/, y)), then, using Conjecture H we obtain 

n{n) < CE ((1 - 5)riogp(p(c/2.„)c/2.„)l^ ^ 



where C — sy^'9xe['d,i] yE Using the same method as for the function F, one gets 

logp(l-'5) 



E ((1 - 5)ri°gp(p(c/2,„)c/2,„)l^ < E ((1 - <5)ri°gp(c^2,„)l) ^ (^i^ 



This gives 



Conclusion 



E(ff„ 



^(") = 0{-, — -— ^). 

logp(n) + E (Llogp(t2)J) + F(logp(n)) + 0(^-^ 



(i-<5)- 



4 Algorithm Cost Distribution 

It is more appropriate to use these notations po ^ Vi Pi = to define recursively the sequence of intervals 
(/^') associated to the binary decomposition of the interval [0, 1] in the base {pq, pi) 



-'o 



[0,1] 



j-n+l ( Tn+1\ I jn 

'fc - 14-1 j+ + Pfe-2Lfe/2j -'Lfc/2J' 

where (/) , denotes the right extremity of the interval /. Let |/| the length of the interval /, then 
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Let n G N and < fc < 2"+^ — 1. Consider the binary decomposition of k at the stage n 

k^ao + ai2 + ... + a„2", for < i < n, Oi £ {0, 1}. 
Then, the length of the interval /^'^^ is 

n 

For G N, a; > 0, one gets the following identity 

{Hj^^ >k}^{3Q<i<2^ : M{xl^) = . . . = M{xlti) = , M{xl^) > 2}. 

So 

2*=-! 

HHm.^ <k)^e-^ + x ^ |/^|e~(^'')+. (7) 
Let us define the sequence of probability measures (^fc) by 

i^k{t)= E 

Then, equation can be rewritten as 

r-l 



¥{Hj^^ < k) ^e-"" +x / e"^* rf^feW- (8) 
Jo 

Using a probabilistic de-Poissonization of equation as done for Proposition ^, we obtain the exact 
distribution of 77„. 

Proposition 3 Forn>2, 



Jo 

where the probability measure /ife is described as above. 
Using this identity 

1 - n(l - = 1 - nt (1 - - n(l - t)", 

the following result is immediate. 
Corollary 1 For A: G N, 

P{Hn > k) ^ V(U2,n < t) dfj,k{t) , CIS Ti goes to infinity. 
Jo 

where [/2,n the second smallest random variable of n uniformly distributed random variables on [0.1] 
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5 Appendix 

We present the proof of Lemma |l|. Recall the sequence of random variables (7^) defined by 

(7»),6N-{J>0 : B,^p}. 

Proof of Lemma |l|: First, note that 

(i.(x)>2 + 7o) ^ (70 > Llogp(x)J) 

(M(y)<7o) ^ (70 > riogp(2/)l). 

Denote by ilg the following set 

r!o = ^{x.y) := {(x,2/) G (]0,1[)2 : [log^Cj/)! = riogp(x)l}. 
By decomposing the function $ with respect to ilo, one gets this formula 

/T(x,y)-l \ ( ^° I \ Llogp(2/)J ^ U(.y)-1 ^ 

y 1=0 *y \i=o ^ / 4=0 ^ y i=o ^ 

/ T{x,y)-1 ^ 
+ E rT)l{70=Llogpfa)J} I If^o- 



-{7n>riogpfe)l} 



*=riogp(s/)i 



Since 



M(y)l(7o > riogp(2/)l) = inf{^ > < y} = flogpCy)] 
then, by simple calculations, one gets 

(r{x,y)-l ^\ / T{x,y)-1 ^ 

E 7 N riogp(2/)i +IE ( E 7)i{7o=Liog,fe)j} I i^^o- 

A second discussion on 71 implies that, on the set (70 = [logp{y)\ , fto), 

{v{x) > 2 + 71) ^ (71 > [logp(p(logp(a:))a:)l) 

(M(y) < 71) ^ (71 > 1 + riogp(p(logp(2/))2/)l), 

where p is a periodic function with magnitude 1 defined by p{z) = (1 — p^^^'^^)/ (1 — p). Moreover, p is 
decreasing on [0, 1[, so on the set 

p{^ogp{x))x < p(\ogpiy))y. 
LetfJi = fl{p{\ogp{x))x, p{\ogp{y))y). Then 

/ T{x,y)~l \ 

^ E - = riogp(y)l + (riogp(p(logp(y))y)l-Llogp(y)J)l^i(.,y) 



?=0 



^-i=l+riogj,(p(logp(y))a)l 



^ {70=[logp(y)J;7i=[logp(p(logp(y))y)]} | -'-□(s.y)- 
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This ends the proof. □ 
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